Coding Speech at Very Low R and Temporal Deco
نویسندگان
چکیده
This paper presents a new method for speech coding at rates around 1.2 kbps based on STRAIGHT, a high quality speech analysis-synthesis method. For encoding spectral information, Modified Restricted Temporal Decomposition (MRTD) based vector quantization is used, where MRTD is a method of temporal decomposition for line spectral frequency parameters. Meanwhile, pitch and gain parameters are coded using linear and spline interpolation, respectively. Subjective test results indicate that the performance of the proposed speech coding method is close to that of the 4.8 kbps US Federal Standard (FS-1016) CELP coder.
منابع مشابه
Very low rate speech coding using temporal decomposition and waveform interpolation
In very low rate coding the aim is to accurately represent speech characteristics as efficiently as possible. High coding gains for the spectral features can be achieved through the use of temporal decomposition. Waveform interpolation coders accurately represent the excitation using characteristic waveforms (CWs) extracted at a constant rate. In this paper, the two approaches are combined into...
متن کاملEffects of ageing on speed and temporal resolution of speech stimuli in older adults
Background: According to previous studies, most of the speech recognition disorders in older adults are the results of deficits in audibility and auditory temporal resolution. In this paper, the effect of ageing on timecompressed speech and auditory temporal resolution by word recognition in continuous and interrupted noise was studied. Methods: A time-compressed speech test (TCST) w...
متن کاملComparative study of different parameters for temporal decomposition based speech coding
Temporal decomposition (TD) is an e ective technique to compress the spectral information of speech through orthogonalization of the matrix of spectral parameters leading to an e cient rate reduction in speech coding applications. The performance of TD is function of the parameters used. Although \decomposition suitability" of a parameter set is typically de ned on the basis of \phonetic releva...
متن کاملA new approach to modeling excitation in very low-rate speech coding
A new method for two-band approximation of excitation signals in an LPC model, to improve speech naturalness in very low rate coding, is proposed. Based on a simpli ed model of Multi-Band Excitation, the method accurately determines the degree of periodicity, using the concept of Instantaneous Frequency (IF) estimation in frequency domain. The harmonic structure in the spectrum of LPC residual,...
متن کاملEfficient sub-optimal temporal decomposition with dynamic weighting of speech signals for coding applications
The Optimized Temporal Decomposition (OTD) technique for Line Spectral Frequencies (LSF) speech envelope representation, under a MMSE criterion, has been shown to be promising for very low bit rate speech coding for storage and broadcast applications. In order to improve perceptual speech quality, a dynamically weighted OTD (DW-OTD) technique is introduced in this work. It extends the OTD by al...
متن کامل